chip seq data Search Results


86
Epigenomics ag chip seq data
Overview of Deep5hmC. ( A ) The training set of Deep5hmC can be derived from matched 5hmC-seq and other epigenetic data such as <t>histone</t> <t>ChIP-seq</t> or DNase-seq/ATAC-seq from one condition. Specifically, the 5hmC-seq data can be collected from tissue-specific human tissues, which include bladder, brain, breast, heart, kidney, liver, lung, marrow, ovary (female), pancreas, placenta (female), prostate (male), colon (sigmoid), colon (transverse), skin, stomach, and testis (male). The matched tissue-specific epigenetic data, such as histone ChIP-seq data profiling histone modification and DNase-seq/ATAC-seq profiling chromatin accessibility, can be collected from public consortiums such as Roadmap Epigenomics and ENCODE. In this context, Deep5hmC aims to predict genome-wide 5hmC modification in a single condition. ( B ) The training set of Deep5hmC can also be derived from matched 5hmC-seq and ChIP-seq from a case–control study (e.g. Alzheimer’s disease (AD) versus healthy control) for predicting differentially hydroxymethylated regions (DhMRs). ( C ) Deep5hmC is a multimodal deep learning model to improve the prediction of tissue/cell type-specific genome-wide 5hmC modification by leveraging both DNA sequence and epigenetic features such as histone modification and chromatin accessibility. Deep5hmC consists of four modules, including Deep5hmC-binary, Deep5hmC-cont, Deep5hmC-gene, and Deep5hmC-diff. Specifically, Deep5hmC-binary takes the labeled 5hmC peaks and non-peaks as the training set to identify the 5hmC-enriched regions. Deep5hmC-cont takes the normalized read counts in 5hmC peaks and aims to predict the continuous 5hmC modification genome-wide. By leveraging Deep5hmC-cont, Deep5hmC-gene aggregates the predictions of Deep5hmC-cont in the gene bodies as the surrogate for the predicted gene expression. Different from Deep5hmC-binary, Deep5hmC-diff takes the labeled DhMRs/non-DhMRs in a case–control design of 5hmC-seq as the training set to predict genome-wide DhMRs and may discover de novo DhMRs. ( D ) Model architecture of Deep5hmC. Deep5hmC consists of both sequence modality and epigenetic modality consisting of their own convolutional neural networks (CNNs) to derive separate feature representations, which will be joined later via the multi-modal factorized bilinear (MFB) pooling fusion layer. The output of the MFB fusion layer will further connect to fully connected layers and the output layer afterward.
Chip Seq Data, supplied by Epigenomics ag, used in various techniques. Bioz Stars score: 86/100, based on 1 PubMed citations. ZERO BIAS - scores, article reviews, protocol conditions and more
https://www.bioz.com/result/chip seq data/product/Epigenomics ag
Average 86 stars, based on 1 article reviews
chip seq data - by Bioz Stars, 2026-03
86/100 stars
  Buy from Supplier

90
Epigenomics ag genome-wide chromatin immunoprecipitation sequencing data
Overview of Deep5hmC. ( A ) The training set of Deep5hmC can be derived from matched 5hmC-seq and other epigenetic data such as <t>histone</t> <t>ChIP-seq</t> or DNase-seq/ATAC-seq from one condition. Specifically, the 5hmC-seq data can be collected from tissue-specific human tissues, which include bladder, brain, breast, heart, kidney, liver, lung, marrow, ovary (female), pancreas, placenta (female), prostate (male), colon (sigmoid), colon (transverse), skin, stomach, and testis (male). The matched tissue-specific epigenetic data, such as histone ChIP-seq data profiling histone modification and DNase-seq/ATAC-seq profiling chromatin accessibility, can be collected from public consortiums such as Roadmap Epigenomics and ENCODE. In this context, Deep5hmC aims to predict genome-wide 5hmC modification in a single condition. ( B ) The training set of Deep5hmC can also be derived from matched 5hmC-seq and ChIP-seq from a case–control study (e.g. Alzheimer’s disease (AD) versus healthy control) for predicting differentially hydroxymethylated regions (DhMRs). ( C ) Deep5hmC is a multimodal deep learning model to improve the prediction of tissue/cell type-specific genome-wide 5hmC modification by leveraging both DNA sequence and epigenetic features such as histone modification and chromatin accessibility. Deep5hmC consists of four modules, including Deep5hmC-binary, Deep5hmC-cont, Deep5hmC-gene, and Deep5hmC-diff. Specifically, Deep5hmC-binary takes the labeled 5hmC peaks and non-peaks as the training set to identify the 5hmC-enriched regions. Deep5hmC-cont takes the normalized read counts in 5hmC peaks and aims to predict the continuous 5hmC modification genome-wide. By leveraging Deep5hmC-cont, Deep5hmC-gene aggregates the predictions of Deep5hmC-cont in the gene bodies as the surrogate for the predicted gene expression. Different from Deep5hmC-binary, Deep5hmC-diff takes the labeled DhMRs/non-DhMRs in a case–control design of 5hmC-seq as the training set to predict genome-wide DhMRs and may discover de novo DhMRs. ( D ) Model architecture of Deep5hmC. Deep5hmC consists of both sequence modality and epigenetic modality consisting of their own convolutional neural networks (CNNs) to derive separate feature representations, which will be joined later via the multi-modal factorized bilinear (MFB) pooling fusion layer. The output of the MFB fusion layer will further connect to fully connected layers and the output layer afterward.
Genome Wide Chromatin Immunoprecipitation Sequencing Data, supplied by Epigenomics ag, used in various techniques. Bioz Stars score: 90/100, based on 1 PubMed citations. ZERO BIAS - scores, article reviews, protocol conditions and more
https://www.bioz.com/result/genome-wide chromatin immunoprecipitation sequencing data/product/Epigenomics ag
Average 90 stars, based on 1 article reviews
genome-wide chromatin immunoprecipitation sequencing data - by Bioz Stars, 2026-03
90/100 stars
  Buy from Supplier

90
Biotechnology Information histone modification chip-seq data sets
Overview of Deep5hmC. ( A ) The training set of Deep5hmC can be derived from matched 5hmC-seq and other epigenetic data such as <t>histone</t> <t>ChIP-seq</t> or DNase-seq/ATAC-seq from one condition. Specifically, the 5hmC-seq data can be collected from tissue-specific human tissues, which include bladder, brain, breast, heart, kidney, liver, lung, marrow, ovary (female), pancreas, placenta (female), prostate (male), colon (sigmoid), colon (transverse), skin, stomach, and testis (male). The matched tissue-specific epigenetic data, such as histone ChIP-seq data profiling histone modification and DNase-seq/ATAC-seq profiling chromatin accessibility, can be collected from public consortiums such as Roadmap Epigenomics and ENCODE. In this context, Deep5hmC aims to predict genome-wide 5hmC modification in a single condition. ( B ) The training set of Deep5hmC can also be derived from matched 5hmC-seq and ChIP-seq from a case–control study (e.g. Alzheimer’s disease (AD) versus healthy control) for predicting differentially hydroxymethylated regions (DhMRs). ( C ) Deep5hmC is a multimodal deep learning model to improve the prediction of tissue/cell type-specific genome-wide 5hmC modification by leveraging both DNA sequence and epigenetic features such as histone modification and chromatin accessibility. Deep5hmC consists of four modules, including Deep5hmC-binary, Deep5hmC-cont, Deep5hmC-gene, and Deep5hmC-diff. Specifically, Deep5hmC-binary takes the labeled 5hmC peaks and non-peaks as the training set to identify the 5hmC-enriched regions. Deep5hmC-cont takes the normalized read counts in 5hmC peaks and aims to predict the continuous 5hmC modification genome-wide. By leveraging Deep5hmC-cont, Deep5hmC-gene aggregates the predictions of Deep5hmC-cont in the gene bodies as the surrogate for the predicted gene expression. Different from Deep5hmC-binary, Deep5hmC-diff takes the labeled DhMRs/non-DhMRs in a case–control design of 5hmC-seq as the training set to predict genome-wide DhMRs and may discover de novo DhMRs. ( D ) Model architecture of Deep5hmC. Deep5hmC consists of both sequence modality and epigenetic modality consisting of their own convolutional neural networks (CNNs) to derive separate feature representations, which will be joined later via the multi-modal factorized bilinear (MFB) pooling fusion layer. The output of the MFB fusion layer will further connect to fully connected layers and the output layer afterward.
Histone Modification Chip Seq Data Sets, supplied by Biotechnology Information, used in various techniques. Bioz Stars score: 90/100, based on 1 PubMed citations. ZERO BIAS - scores, article reviews, protocol conditions and more
https://www.bioz.com/result/histone modification chip-seq data sets/product/Biotechnology Information
Average 90 stars, based on 1 article reviews
histone modification chip-seq data sets - by Bioz Stars, 2026-03
90/100 stars
  Buy from Supplier

90
Broad Institute Inc chip-sequencing (chip-seq) platforms
Overview of Deep5hmC. ( A ) The training set of Deep5hmC can be derived from matched 5hmC-seq and other epigenetic data such as <t>histone</t> <t>ChIP-seq</t> or DNase-seq/ATAC-seq from one condition. Specifically, the 5hmC-seq data can be collected from tissue-specific human tissues, which include bladder, brain, breast, heart, kidney, liver, lung, marrow, ovary (female), pancreas, placenta (female), prostate (male), colon (sigmoid), colon (transverse), skin, stomach, and testis (male). The matched tissue-specific epigenetic data, such as histone ChIP-seq data profiling histone modification and DNase-seq/ATAC-seq profiling chromatin accessibility, can be collected from public consortiums such as Roadmap Epigenomics and ENCODE. In this context, Deep5hmC aims to predict genome-wide 5hmC modification in a single condition. ( B ) The training set of Deep5hmC can also be derived from matched 5hmC-seq and ChIP-seq from a case–control study (e.g. Alzheimer’s disease (AD) versus healthy control) for predicting differentially hydroxymethylated regions (DhMRs). ( C ) Deep5hmC is a multimodal deep learning model to improve the prediction of tissue/cell type-specific genome-wide 5hmC modification by leveraging both DNA sequence and epigenetic features such as histone modification and chromatin accessibility. Deep5hmC consists of four modules, including Deep5hmC-binary, Deep5hmC-cont, Deep5hmC-gene, and Deep5hmC-diff. Specifically, Deep5hmC-binary takes the labeled 5hmC peaks and non-peaks as the training set to identify the 5hmC-enriched regions. Deep5hmC-cont takes the normalized read counts in 5hmC peaks and aims to predict the continuous 5hmC modification genome-wide. By leveraging Deep5hmC-cont, Deep5hmC-gene aggregates the predictions of Deep5hmC-cont in the gene bodies as the surrogate for the predicted gene expression. Different from Deep5hmC-binary, Deep5hmC-diff takes the labeled DhMRs/non-DhMRs in a case–control design of 5hmC-seq as the training set to predict genome-wide DhMRs and may discover de novo DhMRs. ( D ) Model architecture of Deep5hmC. Deep5hmC consists of both sequence modality and epigenetic modality consisting of their own convolutional neural networks (CNNs) to derive separate feature representations, which will be joined later via the multi-modal factorized bilinear (MFB) pooling fusion layer. The output of the MFB fusion layer will further connect to fully connected layers and the output layer afterward.
Chip Sequencing (Chip Seq) Platforms, supplied by Broad Institute Inc, used in various techniques. Bioz Stars score: 90/100, based on 1 PubMed citations. ZERO BIAS - scores, article reviews, protocol conditions and more
https://www.bioz.com/result/chip-sequencing (chip-seq) platforms/product/Broad Institute Inc
Average 90 stars, based on 1 article reviews
chip-sequencing (chip-seq) platforms - by Bioz Stars, 2026-03
90/100 stars
  Buy from Supplier

90
Illumina Inc chip-seq data for 182 mature neutrophil sample(s)
a Schematic of CRD associations, including CRD-QTLs, CRD-gene, and CRD-CRD, with the number of associations found within the maximum sample size available for each cell type and epigenomic mark. b Quantile-quantile (Q-Q) plots and genomic inflation factor ( λ metric) for hCRD-QTLs across 3 autoimmune diseases: Type 1 diabetes (DT1), rheumatoid arthritis (RA), multiple sclerosis (MS) and Type 2 diabetes (DT2) as negative control. c Odds ratios and standard errors of the effect size for the enrichment in hCRD-QTLs in monocytes(MON), neutrophils (NEU), T-cells (TCL) for eight blood cell count traits: Basophil count (BC), Eosinophil count (EC), Lymphocyte count (LC), Monocyte count (MC), <t>Neutrophil</t> count (NC), Platelet count (PC), Red blood cell count (RBC), and White blood cell count (WBC).
Chip Seq Data For 182 Mature Neutrophil Sample(S), supplied by Illumina Inc, used in various techniques. Bioz Stars score: 90/100, based on 1 PubMed citations. ZERO BIAS - scores, article reviews, protocol conditions and more
https://www.bioz.com/result/chip-seq data for 182 mature neutrophil sample(s)/product/Illumina Inc
Average 90 stars, based on 1 article reviews
chip-seq data for 182 mature neutrophil sample(s) - by Bioz Stars, 2026-03
90/100 stars
  Buy from Supplier

90
Biotechnology Information rna-seq data of oe-ptrlbd39
a Schematic of CRD associations, including CRD-QTLs, CRD-gene, and CRD-CRD, with the number of associations found within the maximum sample size available for each cell type and epigenomic mark. b Quantile-quantile (Q-Q) plots and genomic inflation factor ( λ metric) for hCRD-QTLs across 3 autoimmune diseases: Type 1 diabetes (DT1), rheumatoid arthritis (RA), multiple sclerosis (MS) and Type 2 diabetes (DT2) as negative control. c Odds ratios and standard errors of the effect size for the enrichment in hCRD-QTLs in monocytes(MON), neutrophils (NEU), T-cells (TCL) for eight blood cell count traits: Basophil count (BC), Eosinophil count (EC), Lymphocyte count (LC), Monocyte count (MC), <t>Neutrophil</t> count (NC), Platelet count (PC), Red blood cell count (RBC), and White blood cell count (WBC).
Rna Seq Data Of Oe Ptrlbd39, supplied by Biotechnology Information, used in various techniques. Bioz Stars score: 90/100, based on 1 PubMed citations. ZERO BIAS - scores, article reviews, protocol conditions and more
https://www.bioz.com/result/rna-seq data of oe-ptrlbd39/product/Biotechnology Information
Average 90 stars, based on 1 article reviews
rna-seq data of oe-ptrlbd39 - by Bioz Stars, 2026-03
90/100 stars
  Buy from Supplier

90
Solexa chip-seq data
a Schematic of CRD associations, including CRD-QTLs, CRD-gene, and CRD-CRD, with the number of associations found within the maximum sample size available for each cell type and epigenomic mark. b Quantile-quantile (Q-Q) plots and genomic inflation factor ( λ metric) for hCRD-QTLs across 3 autoimmune diseases: Type 1 diabetes (DT1), rheumatoid arthritis (RA), multiple sclerosis (MS) and Type 2 diabetes (DT2) as negative control. c Odds ratios and standard errors of the effect size for the enrichment in hCRD-QTLs in monocytes(MON), neutrophils (NEU), T-cells (TCL) for eight blood cell count traits: Basophil count (BC), Eosinophil count (EC), Lymphocyte count (LC), Monocyte count (MC), <t>Neutrophil</t> count (NC), Platelet count (PC), Red blood cell count (RBC), and White blood cell count (WBC).
Chip Seq Data, supplied by Solexa, used in various techniques. Bioz Stars score: 90/100, based on 1 PubMed citations. ZERO BIAS - scores, article reviews, protocol conditions and more
https://www.bioz.com/result/chip-seq data/product/Solexa
Average 90 stars, based on 1 article reviews
chip-seq data - by Bioz Stars, 2026-03
90/100 stars
  Buy from Supplier

90
Biotechnology Information chip-seq data
a Schematic of CRD associations, including CRD-QTLs, CRD-gene, and CRD-CRD, with the number of associations found within the maximum sample size available for each cell type and epigenomic mark. b Quantile-quantile (Q-Q) plots and genomic inflation factor ( λ metric) for hCRD-QTLs across 3 autoimmune diseases: Type 1 diabetes (DT1), rheumatoid arthritis (RA), multiple sclerosis (MS) and Type 2 diabetes (DT2) as negative control. c Odds ratios and standard errors of the effect size for the enrichment in hCRD-QTLs in monocytes(MON), neutrophils (NEU), T-cells (TCL) for eight blood cell count traits: Basophil count (BC), Eosinophil count (EC), Lymphocyte count (LC), Monocyte count (MC), <t>Neutrophil</t> count (NC), Platelet count (PC), Red blood cell count (RBC), and White blood cell count (WBC).
Chip Seq Data, supplied by Biotechnology Information, used in various techniques. Bioz Stars score: 90/100, based on 1 PubMed citations. ZERO BIAS - scores, article reviews, protocol conditions and more
https://www.bioz.com/result/chip-seq data/product/Biotechnology Information
Average 90 stars, based on 1 article reviews
chip-seq data - by Bioz Stars, 2026-03
90/100 stars
  Buy from Supplier

90
StemCells Inc chip-seq data
a Schematic of CRD associations, including CRD-QTLs, CRD-gene, and CRD-CRD, with the number of associations found within the maximum sample size available for each cell type and epigenomic mark. b Quantile-quantile (Q-Q) plots and genomic inflation factor ( λ metric) for hCRD-QTLs across 3 autoimmune diseases: Type 1 diabetes (DT1), rheumatoid arthritis (RA), multiple sclerosis (MS) and Type 2 diabetes (DT2) as negative control. c Odds ratios and standard errors of the effect size for the enrichment in hCRD-QTLs in monocytes(MON), neutrophils (NEU), T-cells (TCL) for eight blood cell count traits: Basophil count (BC), Eosinophil count (EC), Lymphocyte count (LC), Monocyte count (MC), <t>Neutrophil</t> count (NC), Platelet count (PC), Red blood cell count (RBC), and White blood cell count (WBC).
Chip Seq Data, supplied by StemCells Inc, used in various techniques. Bioz Stars score: 90/100, based on 1 PubMed citations. ZERO BIAS - scores, article reviews, protocol conditions and more
https://www.bioz.com/result/chip-seq data/product/StemCells Inc
Average 90 stars, based on 1 article reviews
chip-seq data - by Bioz Stars, 2026-03
90/100 stars
  Buy from Supplier

90
Clevergene Biocorp Pvt chip-seq data
a Schematic of CRD associations, including CRD-QTLs, CRD-gene, and CRD-CRD, with the number of associations found within the maximum sample size available for each cell type and epigenomic mark. b Quantile-quantile (Q-Q) plots and genomic inflation factor ( λ metric) for hCRD-QTLs across 3 autoimmune diseases: Type 1 diabetes (DT1), rheumatoid arthritis (RA), multiple sclerosis (MS) and Type 2 diabetes (DT2) as negative control. c Odds ratios and standard errors of the effect size for the enrichment in hCRD-QTLs in monocytes(MON), neutrophils (NEU), T-cells (TCL) for eight blood cell count traits: Basophil count (BC), Eosinophil count (EC), Lymphocyte count (LC), Monocyte count (MC), <t>Neutrophil</t> count (NC), Platelet count (PC), Red blood cell count (RBC), and White blood cell count (WBC).
Chip Seq Data, supplied by Clevergene Biocorp Pvt, used in various techniques. Bioz Stars score: 90/100, based on 1 PubMed citations. ZERO BIAS - scores, article reviews, protocol conditions and more
https://www.bioz.com/result/chip-seq data/product/Clevergene Biocorp Pvt
Average 90 stars, based on 1 article reviews
chip-seq data - by Bioz Stars, 2026-03
90/100 stars
  Buy from Supplier

90
Omics Data Automation atac-seq data
a Schematic of CRD associations, including CRD-QTLs, CRD-gene, and CRD-CRD, with the number of associations found within the maximum sample size available for each cell type and epigenomic mark. b Quantile-quantile (Q-Q) plots and genomic inflation factor ( λ metric) for hCRD-QTLs across 3 autoimmune diseases: Type 1 diabetes (DT1), rheumatoid arthritis (RA), multiple sclerosis (MS) and Type 2 diabetes (DT2) as negative control. c Odds ratios and standard errors of the effect size for the enrichment in hCRD-QTLs in monocytes(MON), neutrophils (NEU), T-cells (TCL) for eight blood cell count traits: Basophil count (BC), Eosinophil count (EC), Lymphocyte count (LC), Monocyte count (MC), <t>Neutrophil</t> count (NC), Platelet count (PC), Red blood cell count (RBC), and White blood cell count (WBC).
Atac Seq Data, supplied by Omics Data Automation, used in various techniques. Bioz Stars score: 90/100, based on 1 PubMed citations. ZERO BIAS - scores, article reviews, protocol conditions and more
https://www.bioz.com/result/atac-seq data/product/Omics Data Automation
Average 90 stars, based on 1 article reviews
atac-seq data - by Bioz Stars, 2026-03
90/100 stars
  Buy from Supplier

90
Epigenomics ag histone mark chip-seq read alignments
a Schematic of CRD associations, including CRD-QTLs, CRD-gene, and CRD-CRD, with the number of associations found within the maximum sample size available for each cell type and epigenomic mark. b Quantile-quantile (Q-Q) plots and genomic inflation factor ( λ metric) for hCRD-QTLs across 3 autoimmune diseases: Type 1 diabetes (DT1), rheumatoid arthritis (RA), multiple sclerosis (MS) and Type 2 diabetes (DT2) as negative control. c Odds ratios and standard errors of the effect size for the enrichment in hCRD-QTLs in monocytes(MON), neutrophils (NEU), T-cells (TCL) for eight blood cell count traits: Basophil count (BC), Eosinophil count (EC), Lymphocyte count (LC), Monocyte count (MC), <t>Neutrophil</t> count (NC), Platelet count (PC), Red blood cell count (RBC), and White blood cell count (WBC).
Histone Mark Chip Seq Read Alignments, supplied by Epigenomics ag, used in various techniques. Bioz Stars score: 90/100, based on 1 PubMed citations. ZERO BIAS - scores, article reviews, protocol conditions and more
https://www.bioz.com/result/histone mark chip-seq read alignments/product/Epigenomics ag
Average 90 stars, based on 1 article reviews
histone mark chip-seq read alignments - by Bioz Stars, 2026-03
90/100 stars
  Buy from Supplier

Image Search Results


Overview of Deep5hmC. ( A ) The training set of Deep5hmC can be derived from matched 5hmC-seq and other epigenetic data such as histone ChIP-seq or DNase-seq/ATAC-seq from one condition. Specifically, the 5hmC-seq data can be collected from tissue-specific human tissues, which include bladder, brain, breast, heart, kidney, liver, lung, marrow, ovary (female), pancreas, placenta (female), prostate (male), colon (sigmoid), colon (transverse), skin, stomach, and testis (male). The matched tissue-specific epigenetic data, such as histone ChIP-seq data profiling histone modification and DNase-seq/ATAC-seq profiling chromatin accessibility, can be collected from public consortiums such as Roadmap Epigenomics and ENCODE. In this context, Deep5hmC aims to predict genome-wide 5hmC modification in a single condition. ( B ) The training set of Deep5hmC can also be derived from matched 5hmC-seq and ChIP-seq from a case–control study (e.g. Alzheimer’s disease (AD) versus healthy control) for predicting differentially hydroxymethylated regions (DhMRs). ( C ) Deep5hmC is a multimodal deep learning model to improve the prediction of tissue/cell type-specific genome-wide 5hmC modification by leveraging both DNA sequence and epigenetic features such as histone modification and chromatin accessibility. Deep5hmC consists of four modules, including Deep5hmC-binary, Deep5hmC-cont, Deep5hmC-gene, and Deep5hmC-diff. Specifically, Deep5hmC-binary takes the labeled 5hmC peaks and non-peaks as the training set to identify the 5hmC-enriched regions. Deep5hmC-cont takes the normalized read counts in 5hmC peaks and aims to predict the continuous 5hmC modification genome-wide. By leveraging Deep5hmC-cont, Deep5hmC-gene aggregates the predictions of Deep5hmC-cont in the gene bodies as the surrogate for the predicted gene expression. Different from Deep5hmC-binary, Deep5hmC-diff takes the labeled DhMRs/non-DhMRs in a case–control design of 5hmC-seq as the training set to predict genome-wide DhMRs and may discover de novo DhMRs. ( D ) Model architecture of Deep5hmC. Deep5hmC consists of both sequence modality and epigenetic modality consisting of their own convolutional neural networks (CNNs) to derive separate feature representations, which will be joined later via the multi-modal factorized bilinear (MFB) pooling fusion layer. The output of the MFB fusion layer will further connect to fully connected layers and the output layer afterward.

Journal: Bioinformatics

Article Title: Deep5hmC: predicting genome-wide 5-hydroxymethylcytosine landscape via a multimodal deep learning model

doi: 10.1093/bioinformatics/btae528

Figure Lengend Snippet: Overview of Deep5hmC. ( A ) The training set of Deep5hmC can be derived from matched 5hmC-seq and other epigenetic data such as histone ChIP-seq or DNase-seq/ATAC-seq from one condition. Specifically, the 5hmC-seq data can be collected from tissue-specific human tissues, which include bladder, brain, breast, heart, kidney, liver, lung, marrow, ovary (female), pancreas, placenta (female), prostate (male), colon (sigmoid), colon (transverse), skin, stomach, and testis (male). The matched tissue-specific epigenetic data, such as histone ChIP-seq data profiling histone modification and DNase-seq/ATAC-seq profiling chromatin accessibility, can be collected from public consortiums such as Roadmap Epigenomics and ENCODE. In this context, Deep5hmC aims to predict genome-wide 5hmC modification in a single condition. ( B ) The training set of Deep5hmC can also be derived from matched 5hmC-seq and ChIP-seq from a case–control study (e.g. Alzheimer’s disease (AD) versus healthy control) for predicting differentially hydroxymethylated regions (DhMRs). ( C ) Deep5hmC is a multimodal deep learning model to improve the prediction of tissue/cell type-specific genome-wide 5hmC modification by leveraging both DNA sequence and epigenetic features such as histone modification and chromatin accessibility. Deep5hmC consists of four modules, including Deep5hmC-binary, Deep5hmC-cont, Deep5hmC-gene, and Deep5hmC-diff. Specifically, Deep5hmC-binary takes the labeled 5hmC peaks and non-peaks as the training set to identify the 5hmC-enriched regions. Deep5hmC-cont takes the normalized read counts in 5hmC peaks and aims to predict the continuous 5hmC modification genome-wide. By leveraging Deep5hmC-cont, Deep5hmC-gene aggregates the predictions of Deep5hmC-cont in the gene bodies as the surrogate for the predicted gene expression. Different from Deep5hmC-binary, Deep5hmC-diff takes the labeled DhMRs/non-DhMRs in a case–control design of 5hmC-seq as the training set to predict genome-wide DhMRs and may discover de novo DhMRs. ( D ) Model architecture of Deep5hmC. Deep5hmC consists of both sequence modality and epigenetic modality consisting of their own convolutional neural networks (CNNs) to derive separate feature representations, which will be joined later via the multi-modal factorized bilinear (MFB) pooling fusion layer. The output of the MFB fusion layer will further connect to fully connected layers and the output layer afterward.

Article Snippet: For “Human Tissues,” we carefully select aligned bed files of ChIP-seq data from Roadmap Epigenomics by ensuring a match between ChIP-seq data and 5hmC-seq data based on tissue type.

Techniques: Derivative Assay, ChIP-sequencing, Modification, Genome Wide, Control, Sequencing, Labeling, Expressing

Distribution pattern of histone modification around 5hmC peaks. EB 5hmC peaks are collected from “Forebrain Organoid” 5hmC-seq data and ChIP-seq data in “Brain Angular Gyrus” from seven histone marks are collected from Roadmap Epigenomics. Histone features are obtained and averaged in the neighborhood of all 5hmC peaks for the positive and negative sets, respectively. Specifically, histone features are created by segmenting an extended genomic region of 10 kb both upstream and downstream of each 5hmC peak into 41 1 kb windows with a sliding size of 500 bp and counting reads for each 1 kb windows. For each histone mark, the Kolmogorov–Smirnov test is performed to test the distribution difference of histone features between positive and negative 5hmC peaks and the P -value is reported.

Journal: Bioinformatics

Article Title: Deep5hmC: predicting genome-wide 5-hydroxymethylcytosine landscape via a multimodal deep learning model

doi: 10.1093/bioinformatics/btae528

Figure Lengend Snippet: Distribution pattern of histone modification around 5hmC peaks. EB 5hmC peaks are collected from “Forebrain Organoid” 5hmC-seq data and ChIP-seq data in “Brain Angular Gyrus” from seven histone marks are collected from Roadmap Epigenomics. Histone features are obtained and averaged in the neighborhood of all 5hmC peaks for the positive and negative sets, respectively. Specifically, histone features are created by segmenting an extended genomic region of 10 kb both upstream and downstream of each 5hmC peak into 41 1 kb windows with a sliding size of 500 bp and counting reads for each 1 kb windows. For each histone mark, the Kolmogorov–Smirnov test is performed to test the distribution difference of histone features between positive and negative 5hmC peaks and the P -value is reported.

Article Snippet: For “Human Tissues,” we carefully select aligned bed files of ChIP-seq data from Roadmap Epigenomics by ensuring a match between ChIP-seq data and 5hmC-seq data based on tissue type.

Techniques: Modification, ChIP-sequencing

Comparison of unimodal and multimodal Deep5hmC for predicting binary 5hmC modification sites. When using histone modification in the epigenetic modality, two unimodal models of Deep5hmC: Deep5hmC-Seq using only DNA sequence as the model input and Deep5hmC-His using only histone modification as the model input are compared to the default multimodal Deep5hmC-Seq+His using both DNA sequence and histone modification as the model input. 5hmC peaks from the EB stage “Forebrain Organoid” and two histone marks: H3K4me1 and H3K4me3 ChIP-seq data in all brain regions from Roadmap Epigenomics are used as the training set. ( A ) AUROC reported for three compared methods. ( B ) AUPRC reported three compared methods.

Journal: Bioinformatics

Article Title: Deep5hmC: predicting genome-wide 5-hydroxymethylcytosine landscape via a multimodal deep learning model

doi: 10.1093/bioinformatics/btae528

Figure Lengend Snippet: Comparison of unimodal and multimodal Deep5hmC for predicting binary 5hmC modification sites. When using histone modification in the epigenetic modality, two unimodal models of Deep5hmC: Deep5hmC-Seq using only DNA sequence as the model input and Deep5hmC-His using only histone modification as the model input are compared to the default multimodal Deep5hmC-Seq+His using both DNA sequence and histone modification as the model input. 5hmC peaks from the EB stage “Forebrain Organoid” and two histone marks: H3K4me1 and H3K4me3 ChIP-seq data in all brain regions from Roadmap Epigenomics are used as the training set. ( A ) AUROC reported for three compared methods. ( B ) AUPRC reported three compared methods.

Article Snippet: For “Human Tissues,” we carefully select aligned bed files of ChIP-seq data from Roadmap Epigenomics by ensuring a match between ChIP-seq data and 5hmC-seq data based on tissue type.

Techniques: Comparison, Modification, Sequencing, ChIP-sequencing

a Schematic of CRD associations, including CRD-QTLs, CRD-gene, and CRD-CRD, with the number of associations found within the maximum sample size available for each cell type and epigenomic mark. b Quantile-quantile (Q-Q) plots and genomic inflation factor ( λ metric) for hCRD-QTLs across 3 autoimmune diseases: Type 1 diabetes (DT1), rheumatoid arthritis (RA), multiple sclerosis (MS) and Type 2 diabetes (DT2) as negative control. c Odds ratios and standard errors of the effect size for the enrichment in hCRD-QTLs in monocytes(MON), neutrophils (NEU), T-cells (TCL) for eight blood cell count traits: Basophil count (BC), Eosinophil count (EC), Lymphocyte count (LC), Monocyte count (MC), Neutrophil count (NC), Platelet count (PC), Red blood cell count (RBC), and White blood cell count (WBC).

Journal: Communications Biology

Article Title: Genetic variation in cis-regulatory domains suggests cell type-specific regulatory mechanisms in immunity

doi: 10.1038/s42003-023-04688-3

Figure Lengend Snippet: a Schematic of CRD associations, including CRD-QTLs, CRD-gene, and CRD-CRD, with the number of associations found within the maximum sample size available for each cell type and epigenomic mark. b Quantile-quantile (Q-Q) plots and genomic inflation factor ( λ metric) for hCRD-QTLs across 3 autoimmune diseases: Type 1 diabetes (DT1), rheumatoid arthritis (RA), multiple sclerosis (MS) and Type 2 diabetes (DT2) as negative control. c Odds ratios and standard errors of the effect size for the enrichment in hCRD-QTLs in monocytes(MON), neutrophils (NEU), T-cells (TCL) for eight blood cell count traits: Basophil count (BC), Eosinophil count (EC), Lymphocyte count (LC), Monocyte count (MC), Neutrophil count (NC), Platelet count (PC), Red blood cell count (RBC), and White blood cell count (WBC).

Article Snippet: Data from the European Genome-Phenome Archive https://ega-archive.org/datasets : EGAD00001002663 Illumina HiSeq 2000, 193 samples EGAD00010000850 DNA methylation profiles of monocytes, neutrophils and T cells from 525 healthy donors EGAD00001002675 RNA-Seq data for 205 mature neutrophil sample(s) EGAD00001002670 ChIP-Seq data for 182 mature neutrophil sample(s) EGAD00001002671 RNA-Seq data for 212 CD4-positive, alpha-beta T cell sample(s) EGAD00001002673 ChIP-Seq data for 154 CD4-positive, alpha-beta T cell sample(s) EGAD00001002672 ChIP-Seq data for 172 CD14-positive, CD16-negative classical monocyte sample(s) EGAD00001002674 RNA-Seq data for 197 CD14-positive, CD16-negative classical monocyte sample(s).

Techniques: Negative Control, Cell Counting

a Fraction of neutrophil chromatin peak pairs on the same chromosome supported by PCHi-C data (CHiCAGO score ≥ 5) at significantly associated (pink) and non-associated (blue) pairs of chromatin peaks within bins of increasing distance between peaks. b Fraction of hCRD-gene and mCRD-gene associations supported by PCHi-C data (CHiCAGO score ≥ 5) at increasing CRD-gene distances. c Fraction of hCRD-gene associations and mCRD-gene associations supported by PCHi-C data (mean CHiCAGO score ≥ 5) for pairs of co-expressed genes (5%FDR) that associate with the same CRD. The fraction is measured at bins of increasing distance between co-expressed genes.

Journal: Communications Biology

Article Title: Genetic variation in cis-regulatory domains suggests cell type-specific regulatory mechanisms in immunity

doi: 10.1038/s42003-023-04688-3

Figure Lengend Snippet: a Fraction of neutrophil chromatin peak pairs on the same chromosome supported by PCHi-C data (CHiCAGO score ≥ 5) at significantly associated (pink) and non-associated (blue) pairs of chromatin peaks within bins of increasing distance between peaks. b Fraction of hCRD-gene and mCRD-gene associations supported by PCHi-C data (CHiCAGO score ≥ 5) at increasing CRD-gene distances. c Fraction of hCRD-gene associations and mCRD-gene associations supported by PCHi-C data (mean CHiCAGO score ≥ 5) for pairs of co-expressed genes (5%FDR) that associate with the same CRD. The fraction is measured at bins of increasing distance between co-expressed genes.

Article Snippet: Data from the European Genome-Phenome Archive https://ega-archive.org/datasets : EGAD00001002663 Illumina HiSeq 2000, 193 samples EGAD00010000850 DNA methylation profiles of monocytes, neutrophils and T cells from 525 healthy donors EGAD00001002675 RNA-Seq data for 205 mature neutrophil sample(s) EGAD00001002670 ChIP-Seq data for 182 mature neutrophil sample(s) EGAD00001002671 RNA-Seq data for 212 CD4-positive, alpha-beta T cell sample(s) EGAD00001002673 ChIP-Seq data for 154 CD4-positive, alpha-beta T cell sample(s) EGAD00001002672 ChIP-Seq data for 172 CD14-positive, CD16-negative classical monocyte sample(s) EGAD00001002674 RNA-Seq data for 197 CD14-positive, CD16-negative classical monocyte sample(s).

Techniques: